Crispo - Excel Challenge 08 2026

excel-challenges

weekly-exercises

Easy Sunday Excel Challenge

Published

February 22, 2026

Illustration for Crispo - Excel Challenge 08 2026

Challenge Description

Easy Sunday Excel Challenge

⭐ ⭐Count Complete and OK Invoices

Solutions

library(tidyverse)
library(readxl)

path <- "2026-02-22/Challenge 104.xlsx"
input <- read_excel(path, sheet = "Sheet3", range = "B3:O24")
test <- read_excel(path, sheet = "Sheet3", range = "Q8:X12")


result = input %>%
  filter(rowSums(!is.na(.)) > 1) %>%
  select(where(~ any(!is.na(.)))) %>%
  janitor::row_to_names(row_number = 1) %>%
  janitor::clean_names() %>%
  fill(!starts_with("na"), .direction = "down") %>%
  fill(na_3, .direction = "down") %>%
  rename(
    Course = na,
    Hours = na_2,
    Session = na_3
  ) %>%
  select(where(~ any(!is.na(.)))) %>%
  fill(everything(), .direction = "up") %>%
  distinct() %>%
  mutate(
    date = janitor::excel_numeric_to_date(as.numeric(date)) %>% as.POSIXct(),
    cost = as.numeric(cost)
  )

names(result) = names(test)
all.equal(result, test)
#> [1] TRUE

Logic:
- Reads the workbook range needed for the challenge
- Builds the intermediate helper columns that drive the final answer
Strengths:
- The R solution stays compact and mirrors the workbook logic closely.
Areas for Improvement:
- The code assumes the workbook layout and named ranges remain stable.
Gem:
- The best part of the solution is choosing a tidy intermediate shape before producing the final answer.

import pandas as pd
import numpy as np

path = "2026-02-22/Challenge 104.xlsx"
input = pd.read_excel(path, sheet_name="Sheet3", usecols="B:O", skiprows=2, nrows=22)
test = pd.read_excel(path, sheet_name="Sheet3", usecols="Q:X", skiprows=7, nrows=4)

result = input[input.apply(lambda x: x.count(), axis=1) > 1]
result.columns = result.iloc[0]
result = result[1:].reset_index(drop=True)
result = result.dropna(axis=1, how='all')
result = result.groupby(result.index // 2).agg(lambda x: ' '.join(x.dropna().astype(str)))
result.columns = test.columns
for col in ['Hours', 'Sessions', 'Cost']:
    if col in result.columns:
        nums = pd.to_numeric(result[col].astype(str).str.extract(r'(-?\d+\.?\d*)')[0], errors='coerce')
        nums = nums.round(0)           
        result[col] = nums.astype('int64')
if 'Date' in result.columns:
            result['Date'] = pd.to_datetime(result['Date'], errors='coerce')
print(result.equals(test))

Logic:
- Reads the workbook range needed for the challenge
- Aggregates or ranks values at the correct grouping level
- Applies the rule iteratively until the output is complete
Strengths:
- The Python version keeps the same rule in a direct pandas-oriented workflow.
Areas for Improvement:
- As with the R version, any workbook layout change would require small adjustments.
Gem:
- The implementation stays close to the stated challenge instead of adding unnecessary complexity.

Difficulty Level

This task is easy to moderate:

The business rule is readable, but the workbook still needs a few careful transformation steps.

Challenge Description

Solutions

Difficulty Level

Related Challenges